Finding approximate gene clusters with Gecko 3

نویسندگان

  • Sascha Winter
  • Katharina Jahn
  • Stefanie Wehner
  • Leon Kuchenbecker
  • Manja Marz
  • Jens Stoye
  • Sebastian Böcker
چکیده

Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The changing tails of a novel short interspersed element in Aedes aegypti: genomic evidence for slippage retrotransposition and the relationship between 3' tandem repeats and the poly(dA) tail.

A novel family of tRNA-related SINEs named gecko was discovered in the yellow fever mosquito, Aedes aegypti. Approximately 7200 copies of gecko were distributed in the A. aegypti genome with a significant bias toward A + T-rich regions. The 3' end of gecko is similar in sequence and identical in secondary structure to the 3' end of MosquI, a non-LTR retrotransposon in A. aegypti. Nine conserved...

متن کامل

Staying sticky: contact self-cleaning of gecko-inspired adhesives.

The exceptionally adhesive foot of the gecko remains clean in dirty environments by shedding contaminants with each step. Synthetic gecko-inspired adhesives have achieved similar attachment strengths to the gecko on smooth surfaces, but the process of contact self-cleaning has yet to be effectively demonstrated. Here, we present the first gecko-inspired adhesive that has matched both the attach...

متن کامل

Modeling Stimulus-Frequency Otoacoustic Emissions in the Gecko

Although lizards lack the basilar-membrane traveling waves evident in mammals, their ears produce stimulus-frequency otoacoustic emissions (SFOAEs) with latencies comparable to those measured in many mammals (1–2 ms or greater). To probe the origin of these relatively long OAE delays, we developed a model of SFOAE generation in the gecko. The model inner ear comprises a collection of linear, co...

متن کامل

Coherent reflection without traveling waves: on the origin of long-latency otoacoustic emissions in lizards.

Lizard ears produce otoacoustic emissions with characteristics often strikingly reminiscent of those measured in mammals. The similarity of their emissions is surprising, given that lizards and mammals manifest major differences in aspects of inner ear morphology and function believed to be relevant to emission generation. For example, lizards such as the gecko evidently lack traveling waves al...

متن کامل

Learning States and Rules for Time Series Anomaly Detections

The normal operation of a device can be characterized in different temporal states. To identify these states, we introduce a clustering algorithm called Gecko that can determine a reasonable number of clusters using our proposed L method. We then use the RIPPER classification algorithm to describe these states in logical rules. Finally, transitional logic between the states is added to create a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 44  شماره 

صفحات  -

تاریخ انتشار 2016